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ABSTRACT. This note is a brief survey of some results of the recent collabo¬ 
ration of neurobiologists and mathematicians dedicated to stimulus reconstruction 
from neuronal spiking activity. This collaboration, in particular, led to the con¬ 
sideration of binary codes used by brain for encoding a stimuli domain such as a 
rodent’s territory through the combinatorics of its covering by local neighborhoods. 

The survey is addressed to mathematicians (cf. [DeSchOl]) and focusses on the 
idea that stimuli spaces are represented by the relevant neural codes as simplicial 
sets and thus encode say, the homotopy type of space if local neighbourhoods are 
convex (see [CuR08], [CuItVCYol3], [Yolf], [SiGh07]). 

1. Introduction and summary. In this note, a code C in general means a set 
of words in a finite alphabet. Finite seqnences of code words encode certain data, 
and mathematical problems related to codes depend on the class of sitnations in 
which encoding/decoding of the relevant data occnrs. For the restricted pnrposes 
of this introduction, we will divide these situations and the relevant problems into 
three groups. 

(i) Godes and encoding/decoding procedures in cryptography mast efficiently work 
in those situations where there is a challenge of unauthorised access to and/or 
falsihcation of the encoded information, cf. [YaOO]. 

In the relevant group of mathematical problems, encoding/decoding programs 
must be computationally easy, but the choice of the actually used program by a 
pair (or a larger group) of users must itself be encoded in such a way, that guess¬ 
ing it becomes computationally unfeasible for an unauthorised person/computer 
intervention. 

Since the notion of computational feasibility itself became formalised in a highly 
non-trivial way, in these studies the P/NP problem is often invoked. 
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(ii) Error-correcting codes must be constructed in such a way that when the 
information encoded by a sequence of code words is trans mi tted through a noisy 
channel, decoding algorithms could reconstruct the initial text from its corrupted 
version, at least with high probability. M. Tsfasman and S. Vladut were among the 
pioneers of the creation and study of efficient algebraic geometric error-correcting 
codes and coauthored an influential monograph about them that exists now in 
several editions and translations, cf. [VlaNoTsfaO?]. 

Interesting unsolved mathematical problems in the theory of error-correcting 
codes are stated in terms of properties of the set of all (unstructured or structured) 
codes of arbitrary length with a hxed cardinality of the alphabet. The study of 
mutually contradictory requirements of high transmission speed, high probability 
of error-free correction, and computational feasibility of encoding/decoding proce¬ 
dures, led to the notions of asymptotic hound for codes and isolated codes (lying 
above the asymptotic bound): see [VlaNoTsfaO?]. A recent progress is related with 
the understanding that if the Kolmogorov complexity of the code is considered as 
its energy, the standard models of statistical physics furnish an interpretation of 
the asymptotic bound as a phase transition curve: see [Mall] and [MaMarl2]. 

(iii) Finally, neural codes io which this short survey is dedicated are mathematical 
models suggested for the understanding how brain copes with multiple tasks of 
orienting and navigating in the world, cf. [CuItOS], [CuItVCYol3], [Yol4], [Giltl4], 
and references therein. 

The starting point of discussion is the following remark: external stimuli, say 
the spatial environment in which an animal moves, is accessible for (or even created 
by) a scientist. However, the brain of the animal must be able to reconstruct, say, 
a map of this environment and the current position in it using only the action 
potentials (spikes) of the relevant cell groups. 

The philosophy underlying construction of relevant mathematical models can be 
summarised as follows. In laboratory experiments designed to study brain functions 
it is found that stimuli are naturally divided into groups, and with each group a 
certain type of neural activity is associated. 

In [CuItOS] and [Yol4], it is postulated that a given type of stimuli can be 
modelled via a topological, or metric stimuli space X. Furthermore, brain reaction 
to a point in X is modelled by spiking activity of certain hnite set of neurons A/x- 

For example, whenever a rat in laboratory is moving in a restricted space, with 
possible obstacles, Y is a map of accessible territory, whereas regions in which the 
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animal finds himself, are marked by specific activities of hippocampal place cells: 
these cells constitnte elements of A/x- 

In an nnrelated stndy ([MaGaJoGoAshFraFriOO]) of licensed London taxi drivers 
who had to pass an intense two-year training for orientation in (the mental map 
of) London, it was fonnd that the total mass of hippocampal place cells is consid¬ 
erably larger that in the control gronp. (The anthors worked with data from the 
pre-GPS tracker’s era). This stndy is a strong argnment for the assnmption that 
hippocampns carries orientation/navigation nenral networks in hnmans as well as 
in rodents. 

Simplifying and averaging observable spiking data obtained from nenrons in A/x 
at varions points in X, the researchers ([GnItOS], [GnItVGYol3], [Yol4]) postnlate 
the following strnctnres. 

(a) The stimnli space X is endowed with a certain covering {Ui} by snbsets 
called receptive fields. 

(b) Possible patterns of spiking activities correspond to words belonging to the 
respective binary nenral code. Each word in snch code is a map w : Nx —t {0,1} 
or eqnivalently, the snbset rc“^(l) C A/x consisting of nenrons that hre at certain 
points of X whereas the remaining nenrons are inactive. 

(c) The correspondence between certain snbsets of positions in the stimnli space 
and patterns of nenral activity is given by the formnla (1) below. 

(d) Dnring the period of time when an animal stndies his territory, (an appropri¬ 
ate gronp of cells in) his brain generates the code reflecting a map of it. When this 
stndy evolves to the stage of active life in this territory, seqnences of code words 
encode paths in the territory. 

The main problem which is addressed in [GnItOS] and [Yol4] is this: what prop¬ 
erties of X can he reconstructed if one knows only all occurring patterns of neuronal 
activities, that is mathematically, the binary code C := Cx C {0,1}"'? Here and 
below we £x an X and nnmber all nenrons in A/x by (1,..., n} in order to simplify 
notation. 

It tnrns ont, in particnlar, that nnder some additional restrictions npon X and 
{Ui}, the total homotopy type of X can be inferred from the relevant code C. In 
particnlar, hnite seqnences of words in C may be nsed in order to model elements of 
the fundamental groupoid of X: accessible paths from one neighbonrhood in X to 
another one. (Finding snch paths of minimal length nsed to be a constant challenge 
for brains of taxi drivers.) 
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The suggestion that brain uses and somehow encodes the combinatorics of cov¬ 
erings (and more generally, diagrams in (poly)categories) was made already in 
[BrPo03]. 

Plan of the paper. In the section 2, I explain basic geometric structures relat¬ 
ing binary codes to topological objects that model stimuli spaces, and algebraic- 
geometric structures useful for description of codes themselves. The section 3 con¬ 
tains statements of results from [Yol4] showing how neural codes describe homo- 
topy types of stimuli spaces. Finally in section 4, I discuss possible versions of this 
approach for psychological and semiotic studies of languages. 

2. Binary codes and related geometric objects. Let now C be a binary 
code as above. Dehne the support of a code word w = (si,..., Sn) as supp (w) := 
{i \ei = 1} C {1, ... ,n}. The correspondence between C and set of all supports of 
words in C is a bijection, and we will sometimes identify C with the relevant subset 
of subsets in {1,..., n}. 

The binary code C is called simplicial one if for each w E C and each v G {0,1}"^ 
with supp (n) C supp(u>), we have v E C. Clearly, arbitrary binary code C is 
contained in the unique minimal simplicial code which we will denote C. 

Following [Yol4], we will consider three types of geometric objects related to 
binary codes. 

(i) Simplicial set of a code. The set of (supports) of all words of a simplicial 
code C has the natural structure of a simplicial set (see e. g. [GeMa03], sec. 1.2). 
To be more precise, this set of code words dehnes a triangulated space ([GeMa03], 
sec I.l) but the difference between the two notions is inessential in our context. 

We denote the respective topological space A(C). 

If the code C is not simplicial, we put by dehnition A(C) := A(C). 

(ii) Code of a covering. Let Y be a set endowed with a hnite family of subsets 
U := {Ui \ i = 1,... ,n}. Dehne the binary code C{U) (of length n) of this family 
by the following condition ([Yol4], p. 9): 


w E C{U) iff 


n u ^ 


, iesupp {w) 


Lj^supp {w) 


( 1 ) 


We will usually assume that X — 
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(iii) Algebraic geometric description of a binary code. Identify {0,1} with F 2 : 
finite field with two elements. Consider (0, 1}^ as the set of F 2 -points of coordi- 
natized n-dimensional affine space over F 2 . 

Any polynomial P G F 2 [a:i,... ,Xn] determines a code Cp: this code consists of 
all points of F 2 at which P vanishes. 

Call P a reduced polynomial^ if each monomial with non-zero coefficient in P is 
a prodnct of pairwise distinct variables Xi. 

An easy connt (see e. g. [Ma99]) shows that the map P ^ Cp determines a 
bijection between rednced polynomials and binary codes of length n. In other 
words, each code C consists of F 2 -points of a nniqne hypersnrface determined by a 
rednced polynomial over F 2 . The total ideal of polynomials vanishing at all points 
of Cp is generated by P and f = 1,..., n. 

Snch and similar representations of a binary code are nsed in [Yol4] in order to 
devise algorithms for extracting geometric information from nenral codes related 
to stimnli spaces. 

3. Encoding topology of receptive fields. Given a covering U : X = VJUi 
as above, we can constrnct its nerve: the triangnlated space N(U), whose vertices 
Ui bijectively correspond to [/*, whereas 

A; + 1 vertices Ui^,..., Ui^ span a /c — simplex iff n^=o 7^ 0- (2) 

3.1. Proposition. ([Ha02], p. 459, Corollary 4G.3.) Assume that X is a 
paracompact topological space, U its open covering, and all non-empty finite inter¬ 
sections ofUi are contractible. Then homotopy types of X andNiU) are canonically 
equivalent. 

Notice that the contractibility assnmption on Ufs in the orientation problems 
models the fact that visnal held at any point of space is covered by the direct lines 
of vision in all directions. 

3.2. Proposition. ([Yol4], p. 13.) With the same assumptions as above, N(U) 
can be canonically identified with the triangulated space X{C{U). 

Therefore, the total homotopy type of X is encoded byC. 

A check of the statement N{U) = X{C{U)) is omitted in [Yol4], bnt the anthor 
has kindly sent me an easy combinatorial argnment. I will reprodnce it here in 
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order to register that it does not use any topological structure of X: we may simply 
assume that X is a set represented as a union of its subsets Ui to which we apply 
dehnitions (1) and (2). 

The inclusion A{C{U)) C N{U) follows from the fact that if a G A(C(W)), then 
there is w E C{U) whose support contains n, so that applying (1) to w we get (2). 

Conversely, N{U) C A{C{U)). In fact, consider a G N(U). Choose p G Tii^cjUi: 
it exists in view of (2). Put r = {j G n}|p G Uj}. Then a C r and 

applying (1) we see that a belongs to the simplicial completion of C{U) and hence 
0- G A(C(W)). 

Applying Proposition 3.2 to receptive helds, N. E. Youngs in addition assumes 
that the relevant stimuli space X forms a domain in d-dimensional real affine space, 
and that all non-empty intersections of Ui are convex. This assumption is also 
motivated by the orientation experiments, mentioned in sec. 1 above. She shows 
then that additional properties of X can be extracted from C{U), in particular, 
a lower bound for the embedding dimension d and an information about “non- 
obvious” relations between Ui of the form 

Eii^ijUi C Uj^T-Uj for certain cr, r with cr fl r = 0 

([Yol4], p. 15). Algorithms for extracting this information are given in terms of 
polynomial generators of the ideal in F 2 [a;i,... ,Xn] dehning the code. 

4. Neural codes, semiotics, and modes of mathematical thinking. A 

code, as we dehned it in the Introduction, is an analog of dictionary in linguistics. It 
is the list of minimal units of a language. Whereas theory of, say, error-correcting 
codes does not invoke any a priori semantic notions, semantics of neural codes was 
our main preoccupation here. (Cf. similarly motivated discussion of genomics in 
[Ge93] and Svante Paabo’s diagnosis in his book “Neanderthal Man: In Search 
of Lost Genomes”: “The dirty little secret of genomics is that we still know next 
to nothing about how a genome translates into the particularities of a living and 
breathing individual. ”) 

Furthermore, the main problem addressed in [GuItOS], [GuItVGYol3], [Yol4] 
can be compared to the decompilation in computer science: translating a ma¬ 
chine/assembly language code back into a higher level language code. In semiotic 
parlance this means that spiking activity plays the role of signifier whereas the 
nerve of the covering is the signified referring to a stimuli space, its covering, and 
individual positions in the stimuli space. 
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Below I suggest that (a fragment of) the mathematical language used in this 
description of the signiher of neural code might have a wider applicability. I will 
focus on the example of “semantic space”, discussed in [Man08], sec. 2. In most 
variations, one starts with the assumption that “the space of meanings” (say, of 
words in a language) is a certain set X. The next step consists in postulating 
additional structures (in Bourbaki’s sense) on it. 

(a) The most straightforward assumption is that meanings form subsets Ui rather 
than points of X so that we can imagine them as a covering even before (or without) 
postulating a topology. 

For example, P. Guiraud ([Gui68]) suggests that subsets of meaning Ui have a 
natural embedding into affine spaces R'^, whose axes are marked by “semes”. Each 
seme axis corresponds to a semantic opposition (in the structuralist paradigm), 
such as “animate/inanimate”, so that a pure ” yes/no” picture would provide only 
an embedding into {—or else {0,1}'^. However, a more realistic description 
of meanings would allow less localised positions on the seme’s axes. Say, on the 
axis “animate/inanimate”, what place should be assigned to viruses? 

(b) Discussing Zipf’s Law, D. Manin in [Man08] postulates furthermore that X 
is endowed with a measure, and that Ui are its measurable subsets. He uses this 
structure in order to provide a mechanism generating Zipf’s Law. 

On the other hand, in the generating model of Zipf’s Law developed in [Mal3] 
the key role is played by the Kolmogorov complexity of the hypothetical neural 
encoding of the semantic space rather than any specihc structure of this space itself. 
It is implicitly suggested that such encodings are combinatorial objects, exactly as 
in stimuli spaces encodings considered above. Therefore it would be extremely 
interesting to study neural encodings of human languages from the perspective of 
[Gult08] and [GuItVGYol3]. 

(c) My last remark concerns a noteworthy parallel development in contemporary 
foundations of mathematics: Voevodsky’s “univalent foundations”, cf. [HTT13] and 
a brief introduction in [PeWal2]. Roughly speaking, in this model “continuous” 
(homotopy type theory whose logical description involves simplicial sets, exactly as 
in the models of neural codes) belongs to the level of foundations, whereas “discrete” 
(set theory) is a superstructure. During the whole XXth century, from the times 
of Georg Gantor and Felix Hausdorff, these two modes of mathematical thinking 
were developing in the reverse order, from discrete to continuous: cf. Figure 3 on 
p. 10 of [PeWal2]. 



8 


This probably justifies the decompilation metaphor and the image of “Founda¬ 
tions as Superstructure” suggested in [Mal2]. 

Acknowledgements. Preparing this survey, I benefited from e-mail exchanges 
with Nora Esther Youngs, Carina Curto, Vladimir Itskov, Dmitri Manin, Ronald 
Brown. 
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